Assessment of Data Quality of Selected Data Sets in the Department of Energy/comprehensive Nuclear Test-ban- Treaty Knowledge Base

نویسندگان

  • D. N. Hagedorn
  • C. A. LoPresti
  • R. F. O’Brien
  • S. A. Hartley
  • B. G. Amidan
چکیده

The U.S. Department of Energy’s (DOE) Comprehensive Nuclear Test-Ban-Treaty (CTBT) Knowledge Base (KB) contains detailed regional data, from which corrections to seismic, hydroacoustic, and infrasonic signals will be generated. As the KB is populated with information and data sets detailing regional geological and geophysical structures and reference event data, questions of “How good is the information?” and “What confidence can I have in the corrections?” arise. This report documents work todate at the Pacific Northwest National Laboratory on the development of a “toolbox” of statistically-based algorithms which may be used to assess the quality of individual data sets, and consistency across multiple data sets, both on data in the KB and prior to including new data in the KB. Thirteen data sets (consisting of metadata, header, projection, and data files), supplied by Sandia National Laboratories, from the KB were used in this preliminary examination. The metadata files were reviewed before analysis of the data sets began. We noted that some fields were not filled in, others had very brief entries, while yet others were quite complete and informative. Comparing metadata files across data sets, we noted that the quality of the information was not consistent, there were problems with the accuracy/ precision of the numerical data, processing audit trails were poor to nonexistent, and when compared to the headers in the data files, some discrepancies were noted. Several methods were employed to evaluate the individual data sets for spurious data. We believe that because these gridded data sets in the KB are composites created from multiple sources and have been processed and smoothed, no outlier data was found. We did discover in some of the data sets that areas of constant value (algorithm default values) existed, which are not geologically reasonable. We believe that these areas were created as a result of the processing and are not valid data. Problems with agreement between data sets were also identified. Comparing data sets was problematic because of the different grid sizes and cell locations. As an example of what can be done to evaluate agreement between data sets, we examined three Mohorovicic (Moho) Discontinuity depth maps, which contained regions in common to two or all three data sets. Depths to, and trends in, the Moho in the data sets did not agree with each other, and in some instances depths at the same location had differences of up to 20 km. These are all serious problems, which must be rectified prior to using this KB data to generate seismic corrections. The effects of the noted discrepancies on the corrections have not yet been assessed, but we believe that corrections derived from using the different data sets would be significantly different. The cumulative effects of multiple errors could be more drastic. Certain needs for the overall KB were also identified, such as a need for well-defined criteria for accuracy of data, estimates of uncertainty to be associated with individual data, a consistent schema for gridding and combining data sets, and tolerance limits for agreement between data sets. Treatment of uncertainty in the data and understanding the effects of that error on the correction estimates that result from using the KB data is essential. The data sets from the KB used in this study are not raw data. They have been generated from multiple sources, processed, interpolated, and smoothed, and have no estimates of error or uncertainty. Uncertainty estimates cannot be confidently derived from processed and smoothed data, as from raw data. Any estimates of error and uncertainty will need to be inferred from ground-truth events.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quality Assessment of Research Articles in Nuclear Medicine Using STARD and QUADAS-2 Tools

Objective(s): Diagnostic nuclear medicine is being increasingly employed in clinical practice with the advent of new technologies and radiopharmaceuticals. The report of the prevalence of a certain disease is important for assessing the quality of that article. Therefore, this study was performed to evaluate the quality of published nuclear medicine articles and determine the frequency of repor...

متن کامل

A Knowledge-Based System to Support Nuclear Test Ban Treaty Verification

The major technical obstacle to the signing of nuclear test ban treaties is the issue of verifying compliance. Since the banning of atmospheric and oceanic testing (Limited Test Ban Treaty of 1963) pushed testing underground, seismic monitoring has been one of the most important technologies available for monitoring compliance with test ban treaties. The goal of these treaties is to progressive...

متن کامل

Machine learning for radioxenon event classification for the Comprehensive Nuclear-Test-Ban Treaty.

A method of weapon detection for the Comprehensive nuclear-Test-Ban-Treaty (CTBT) consists of monitoring the amount of radioxenon in the atmosphere by measuring and sampling the activity concentration of (131m)Xe, (133)Xe, (133m)Xe, and (135)Xe by radionuclide monitoring. Several explosion samples were simulated based on real data since the measured data of this type is quite rare. These data s...

متن کامل

Seismological Methods of Monitoring Compliance with the Comprehensive Nuclear-Test-Ban Treaty

Seismology provides the key technology for monitoring the occurrence of underground nuclear explosions. A new International Monitoring System (IMS) which is being established under the provisions of the Comprehensive Nuclear-Test-Ban Treaty has begun its work in conjunction with a new International Data Centre in Vienna, and this work is likely to continue to develop even though prospects for e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999